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FORMULAS FOR CALCULATING NUMBER OF FRUITS 

REQUIRED FOR ADEQUATE SAMPLE 

FOR ANALYSIS 1 

F. E. Denny 

When taking samples of variable fruits, as oranges for example, 
it is important to obtain an approximation of the number of fruits 
that should be included in the sample, in order that the results of 
the analyses shall be sufficiently accurate for the purpose of the 
investigation. It is the object of this paper to give formulas which 
may be used in such cases; to illustrate their use by numerical 
examples; to indicate the reliability that may be placed upon them; 
and to show the results that were obtained in applying them to 
the analysis of citrus fruits. 

The first step consisted in obtaining a measure of the variability 
of the fruit in question. In the case of citrus, this was accomplished 
by analyzing individual fruits, since one fruit was found to yield 
enough material for the analytical work performed. It smaller 
fruits, such as plums, were used, it would be necessary to increase 
the sample to half a dozen, or a dozen, or some other number that 
would make a convenient sample with which to work, but the results 
of the analysis of each of the chosen units should be tabulated 
separately. From these data the probable error of a single sample 
was found, and this value formed the starting point for the calcula- 
tions made in formulas described in later paragraphs. 

Variability in composition of individual oranges in 
single sample 

Fifty-one oranges were taken at random from a single tree. 
These fruits were all of good marketable quality, and were appar- 
ently free from diseases, insect injuries, and bruises. They were 
uniform in color, but of course variable in size. The fruits were 
analyzed individually and the results for each fruit tabulated 

1 Published by permission of the Secretary of Agriculture. 
Botanical Gazette, vol. 73] [44 
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separately, as given in table I. At the bottom of the table will be 
found the values for the probable error of the mean and the probable 
error of a single observation. These were calculated from the 



| Sd 2 
\n(n-i) ; 



P.E. sing. 



following formulas: P.E. mean =±0.6745 

I 2d 2 
= =*= 0.6745^17-^— r ; where "n" is the number of variates (in this 

TABLE I 

Composition of fifty-one oranges, Washington Navel variety 



Orange no. 



Degrees 


Percent- 
age of 


Percent- 
age of 




sugar 


acid 


12.80 


9 63 


0.98 


IS 


10 


10.30 


0.98 


12 


So 


9.46 


1.08 


13 


70 


IO.44 


1. 14 


14 


40 


II. 17 


1.06 


IS 


00 


11. 14 


1 .06 


13 


90 


IO.8S 


0.84 


13 


40 


IO-43 


0.98 


13 


70 


IO.94 


0-93 


13 


7° 


IO.65 


0.84 


13 


ss 


IO.71 


0.90 


13 


35 


IO. 14 


1 -15 


13 


20 


I0.35 


0.94 


13 


05 


IO.85 


0.98 


14 


3° 


IO.83 


0.96 


IS 


°5 


n-59 


0-95 


14 


go 


11.80 


1.02 


13 


20 


10.30 


1 .09 


15 


25 


12.05 


1 .00 


13 


40 


10.53 


1 .02 


14 


8.5 


11. 18 


1 . 11 


13 


40 


n-35 


1. 01 


14 


45 


11.49 


0.82 


13 


80 


10.99 


0.91 


IS 


00 


10. 20 


1. 14 


14 


45 


11.28 


1. 22 


14 


30 


11. 15 


1-05 


14 


60 


11. 61 


1. 12 


14 


55 


11 .40 


0.87 



Sol. sol. 



acid 
ratio 



Orange no. 



Degrees 
brix 



Percent- 
age of 
sugar 



Percent- 
age of 
acid 



Sol. sol. 



acid 
ratio 



1 . 
2. 

3- 

4- 
5 ■ 
6. 
7 ■ 
S. 

9- 
10. 
i] . 
12. 

13- 

14. 

IS- 

16. 

17. 
18. 

19. 
20. 
21 . 
22 . 

23 ■ 

24. 

25- 
26. 

27- 

28. 

29. 



13-05 
13-35: 

1 1 . 6oj 
12.00 
13.60 
I4-I5 
16. 55 
i3-7o 
14-75 
16.30 

15-05 
11 .60 

14-051 
14-25 
14.90 

15.85 
14.60 

12. io j 
15-25 
i3-i5j 
13-40 
13-25 
17.60 

I5-IS 
11.40 
11.85 
13.60 

13-05 
16.70 



3°- 
31 

32- 

33- 

34- 

35- 

36. 

37- 

38. 

39 

40. 

41- 
42. 

43- 
44- 
45- 
46. 

47- 
48. 
49- 
So- 
Si- 



Mean 

P.E. mean. 
P.E. sing. . 



14 



10.91 

10.93 
10.68 
11 .92 
10.83 
10.31 
11. 51 
10.96 

H-S7 
11.46 
10. 
10.18 

10.33 
10.09 
11.00 
10.05 
10.76 
11 .26 
H-44 
H-35 
10.88 
11. 13 



1 .07 
0.96 
1. 14 

i-i5 
1.06 
1 .02 



1 .04 
1. 29 
1.23 
0.91 
1 .29 
1. 19 
1.24 
0.94 

1 -IS 
1 .27 
0.86 
0.98 
1.07 

131 

1. 16 



12.80 
14.60 
12.00 
1330 
13-05 
12-95 
17.10 
14. 20 
n.85 
12.4S 
15.10 
10.40 
11 .20 
10.8s 
I5-3S 
11. IS 
11 .00 
17. 10 
15.20 
11.80 
11.30 
12.15 



07 



=°s 



=0.06 



= 0.4 



I 05 



=0.09 



13.60 



=0. 17 



= i-3 



case fifty-one), and 2d 2 is the sum of the squares of the deviations 
of each measurement from the mean. For example, in the column 
under brix, table I, "d" is the deviation of 12.80 from 14.00, etc. 
The probable error of a single sample and the probable error of 
the mean are connected in the following manner: P.E. mean = 

P.E. sing. 

— — '-1= — : , so that after a value for P.E. sing, has been found, the 
In 
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value of P.E. mean for any desired number of fruits may be calcu- 
lated by substituting' this number for "n" in the formula. Thus 
if P.E. sing, has been found to be 0.5, P.E. mean for a sample of 

o. 5 
twenty-five fruits is . — =0.1. 



V 



25 



The values in table II, giving the odds, may be utilized under 
the two following conditions. In the first place, it may be used 
in connection with the analytical results obtained from a single lot 
of fruit to estimate the degree of assurance that an accuracy 
between certain limits has been attained. For example, the average 
sugar content (in table I) was 10.89. If a second sample of fifty-one 
fruits had been taken at the same time and under the same condi- 
tions, we would probably not have obtained exactly this value. 

TABLE II* 

Table of odds 



Coefficient 


Odds 


Coefficient 


Odds 




1 . 00 to 1 

2. 21 to 1 

4 . 64 to I 

9.89 to I 

15-95 to I 

22.26 to I 

31.36 to I 


3 

3 
3 

4 
4 
4 
4 




44.87 to 1 
64.79 to 1 
95.15 to 1 
142.26 to I 
215.92 to I 
332-33 to I 
519.83 to I 


1.5 


6 




8 


2-5 

2.8 




2 




3.2 


6 







* The values in this table were selected from a table by Pearl and Miner 
(6). Original article should be consulted for a complete list of values. 

But the P.E. mean, ±0.06, indicates that the chances are even 
(1 to 1) that the value found would have been between 10.95 an d 
10.83. I n addition to this information, table II shows that the 
chances are 9.89 to 1 that the value would have been between 10.89 
plus (2.5X0.06) and 10.89 minus (2.5X0.06), that is, between 
11.04 and 10.74. 

Considering the probable error of a single sample in connection 
with table II, the P.E. sing, was found to be 0.4. This means that 
if one more fruit had been taken, the chances are even that its 
value would have been between 10.89+0.4, and 10.89—0.4. In 
other words, half the fruits in table I should have sugar values 
between 11.29 an d IO -49> an( l half should be outside these limits. 
Table I shows that twenty-four oranges are within these limits and 
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twenty-seven outside. Table II indicates further that the chances 

are 4.64 to 1 that no single sample would deviate from 10.89 D Y as 

much or more than 2.0. times 0.4: that is, of the fifty-one fruits in 

table I, about nine should be outside the limits 11.69 to 10.09, and 

forty-two should be within them. A count shows that in this case 

five are outside and forty-six within. 

In the second place, table II may be applied in an entirely 

different case, namely, when comparing the analytical results from 

two different lots of fruit in order to estimate the degree of assurance 

that the difference shown between them is significant. For 

example, in table VII it is shown that the refractive index of the 

juice of the Eureka strain of lemons was 44.6= fc o.2, while that of the 

Shade Tree strain was 45.7=^0.3. The difference is 1.1. What are 

the chances that this difference is significant and not due merely 

to a sampling error ? This calculation is made from the following 

, , difference 1.1 1.1 _,, 

formula: ^-^ — T^a = , \ , - = — 2 = 3-° • * ne 

r.L. of difference V (o.2) 2 + (o.3) 2 °-3o 

figure 3.0 is here termed the coefficient of odds, and its value is 

sought in column 1 in table II, from which it appears that the odds 

are about 22 to 1 (judging from these data, at least) that the juice 

of lemons from the Shade Tree strain is higher with respect to 

refractive index. Table II applies only in those cases in which the 

difference between two results may be expected to occur in either 

direction. For a table showing odds when it is known that the 

difference between two results will be in one direction only, see 

Wood (ii, p. 26). 

Formulas for calculating number of fruits for sample 

Two general sets of conditions may be recognized under which 
samples are collected for analysis: (1) When samples are taken 
from each of two or more different lots of fruit, with the object of 
later comparing them, to determine whether the differences between 
them are significant, and what the odds are that this is so. 
(2) When a sample is taken from a single lot of fruit for the pur- 
pose of obtaining a figure that will represent the composition of 
that lot, and to attain a certain assurance that this figure is cor- 
rect within certain desired limits. 
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Haynes and Jxjdd (3) have studied the requirements under the 

first condition. They proposed the following formula for use in 

calculating the number of individuals to include in a sample in 

order that a certain difference between two averages may be 

/sXpV 2 
considered significant: N = 2( ) . N is the "number of 

samples which must be taken in order that there may be a proba- 
bility of 0.957 2 that a 5 per cent difference is significant"; 3 is the 
coefficient in the "table of odds" (table II), and thus is equivalent 
to odds of 22 to 1 ; "p" is the probable error of a single sample and 
must be determined experimentally (in this case by analyzing 
individual fruits). 

Other values than 3 and 5 may be assumed to meet the condi- 
tions of the experiment; therefore, in order to make comparisons 
with what is to follow, it is desired to express the preceding formula 

/coefficient of odds XP.E. singA 2 
m general terms as follows : N = 2 1 -tt~ I 

(formula 1). To illustrate the use of this formula, data may be 
taken from Haynes and Judd's paper. Working with apples, 
they found the mean titration value to be 10.20 with a P.E. sing, 
of 0.78, and the latter is thus 7.7 per cent of the mean. To get an 
assurance of 30 to 1 that a 5 per cent difference is significant: 

N = 2 ( 3 " 2X7-7 ) 2 = 49 apples. 

The problem under the second condition may now be considered. 
We wish a general formula that will connect the number in the 
sample with the probable error of a single fruit and with the 
coefficients in the " table of odds " (table II) . In table I it was shown 
that the mean sugar content was 1089. =±=0.06. What are the 

chances that the "true" value is within the limits ±0.17? The 

0.17 
chances are found in the following way (Merriman 5) : — 7 = 2.8, 

and looking up the coefficient 2.8 in table II, we find the 
chances are about 16 to 1 that the error in. 10.89 i s n °t more 
than ±0.17. 

2 The expression 0.957 may be thought of as indicating a probability of 957 out of 
1000, which represents a ratio of 957 to 43, or about 22 to 1. 
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This relation may now be expressed in general terms by putting 
"deviation" for ±0.17, where it is to be the deviation above 
or below the mean, which we wish to use as a limit for accu- 
racy; then putting "P.E. mean" for 0.06, and "coefficient of 

, . , deviation . 

odds for 2.8, we have: ^^ = coefficient of odds, but 

'_ . P.E. mean ' 

P.E. sing. 
P.E. mean = — — '-=- — ' (Wood ii), and substituting this value, 

N deviation 
the equation becomes =— = — : — = coefficient of odds, from which 
P.E. smg. 

VN 
/coefficient of odds XP.E. singA 2 . , , . 

N = l d^iatl^n ) {formula 2 ). 

In illustration of the use of this formula, table VI shows that fifty 
grapefruits from tree no. 1 had an average brix of 13.15 and the 
P.E. sing, was 0.35. What number of fruits are required to give 
odds of 10 to 1 that the brix of that number will be correct to 
±0.15? Table II shows that for odds of 10 to 1, the coefficient 

of odds is 2.5, therefore N = ( — : — J =thirty-four grapefruits. 

No account is taken of errors in the method of analysis, since in 
the present case analytical errors are small as compared with the 
variability of the individual fruits with respect to the constituent. 
If it is desired to take analytical errors into account also, see 
Waynick (10) and Robinson and Lloyd (7). 

Comparison of formulas 

Although formulas 1 and 2 appear to be very similar, the first 
in fact giving values just double those of the second, certain essential 
differences should be pointed out. Formula 1 applies when two 
(liferent lots are being compared, in which case the significance of 
the difference between them is affected by the sampling error of 
each lot. Formula 2 applies to the analytical results of a single 
lot only, its own error being the only one involved. Such a condi- 
tion arises when an analysis is made for the purpose of reporting 
the composition of a product with respect to a certain constituent, 
or when an analysis is made to determine whether a constituent 
has reached a certain required value. 
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Accuracy of formulas 

In the preceding paragraphs it was found that the use of formulas 
i and 2 gave forty-nine fruits as the required number in one illustra- 
tive case, and thirty-four as the required number under the other 
set of conditions. We should not be justified, however, in conclud- 
ing from this test that forty-seven would be too few in the first 
case, and thirty-six would be more than enough in the second. 
With either formula it is seen that the number N depends for its 
value upon the value of the probable error of a single sample, and 
therefore it becomes necessary to inquire how variable this value is, 
and what effect changes in its value have upon N. 

TABLE III 

Different values obtainable from same lot of fruit 



Calculations after 

the following number 

of fruits analyzed 



Taken in order of 
analysis 



P.E. sing, 
found 



No. of fruits 
required 



Taken in order rearranged by lot 



First rearrangement Second rearrangement 



P.E. sing, 
found 



No. of fruit; 
requited 



P.E. sing, 
found 



No. of fruits 
required 



10 

IS 

20 

25 

30 

35 

40 

45 
5i 



1 .0 

1 . 1 

1 . 1 
1 . 1 
1. 1 
1.2 
i-3 



18 
22 
22 
22 
22 
26 
3i 



1.4 
1-3 

1-3 



36 

36 
36 
36 
36 
36 
36 
3i 
3i 



18 
26 
26 
26 
36 
36 
31 
3i 
31 



It is instructive to note what values would have been obtained 
if the value of P.E. sing, had been taken, not after fifty-one fruits 
had been analyzed, but after the analysis of say ten fruits, or after 
fifteen, or twenty-five. The different values for P.E. sing, and N 
that were obtainable in this manner calculated from formula 1 are 
shown in table III. It is thus found the P.E. sing, varied from 1.0 
to 1.3, which values, substituted in the formula, caused the value 
of N to vary from 18 to 31. Formula 2 would likewise have given 
variable values, but the actual figures would have been one-half 
as large. 

The fruits in table I were analyzed in the order of size, number 
one being the largest. It may be urged that therefore we do not 



1922] 



DENNY— FRUITS 



Si 



have a true random sample, or that there is a correlation between 
size and composition. The correlation coefficient between size and 
the soluble-solids-acid ratio, however, was calculated by the method 
recommended by Toixey (9), and was found to be 0.158, with a 
probable error of 0.092, which does not indicate any significant 
correlation. 

In order to partially eliminate the size of the fruit as a factor, 
the order in table I was rearranged by lot. With the new order, 

TABLE IV 

Results of calculations of probable error based ox 

analysis of groups of ten fruits each 



Groups of io fruits each 



Group 1 

2 
3 

4 
S 

6 

7 

8 

9 
10 
11 
12 
13 
14 
IS 



Solids-acid ratio 




No 


required 


P.E. sing. 


for desired 




assurance 


1.0 




18 





9 




IS 


1 


3 




31 


I 







18 


I 


6 




47 


1 


4 




36 


I 


3 




31 


I 


3 




31 


1 


3 




31 





5 




S 


I 


8 




59 


1 


3 




31 


1 


1 




22 





9 




iS 


I 


2 




26 



the values of P.E. sing, and N were calculated after ten fruits 
were analyzed, after fifteen, etc. The results are shown in 
table III. P.E. sing, was found to vary from 1.3 to 1.4, causing 
N to vary from 31 to 36. Another rearrangement by lot is shown in 
the last two columns of table III. Values of P.E. sing, vary from 
1.0 to 1.4, causing N to change from 18 to 36. In both these cases, 
values by formula 2 would also have been variable, but of course 
would have been just half as large numerically. 

Use of small numbers to calculate probable error of 
single fruit 

It may be inquired what the P.E. sing, would have been for 
different lots of ten fruits each. Groups of ten each were selected 
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by lot and the values of P.E. sing, and N calculated. Strictly 
speaking, when the number involved is small, say ten, the formula 
for P.E. only gives approximate results (Brunt i). The value of 
P.E. sing, for the ratio is thus shown to vary from 0.5 in group 10 
to 1.8 in group n, causing a change in N from 5 to 59 (table IV). 
One trial with a small number of fruits would not be adequate for 
the determination of the value of P.E. sing, and of N, at least with 
such variable material as oranges. 

Probable error of a probable error 

The preceding discussion indicates that variable values were 
found for N, depending on the value found for P.E. sing. To obtain 
an idea of the variability of P.E. sing, and of N in the manner 
described (that is, by obtaining the results given by several different 
groups containing different numbers) is tedious and unsatisfactory. 
A more convenient method of judging the accuracy of P.E. sing, 
and N is desired. It is plain that the probable error calculated 
from the analysis of fifty fruits is more representative of the lot 
than that calculated from ten fruits. The relation of the error in 
the probable error to the number of fruits analyzed is given by the 
expression (Brunt i, p. 57): Probable error of P.E. sing. = P.E. 

sing. X^= (formula 3). Thus if 1.3 is the P.E. sing, for 
V n— 1 

the soluble-solids-acid ratio (table I), then the probable error 

of 1.3 = i-3 X / =0.09, or about 0.1. In other words, the 

V 51-1 

"true" value of P.E. sing, is probably between 1.2 and 1.4. We 

may obtain an estimate of the limits of N by substituting 1.2 and 

1.4 successively in the formulas; in this case N is found to be 26 

or 36 for formula 1, and 13 or 18 for formula 2. 

Ordinarily it will be sufficient to consider the probable limits 

of the value of N by approximations made by the use of formula 3 

in the manner indicated. If it is found desirable to do so, however, 

a formula may be used for the correction. If we rearrange formula 

/ coefficient \ 2 
1 to read: N = 2( -tt~ ) (P.E. sing.) 2 , and apply the method 

described by Goodwin (2), we find that deviation produced in the 
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value of N by an error in the value of P.E. sing, is as follows: 

, T /coefficient\ 2 ,,, „ . , 1 

n . . . M 4Kdiffe7e^) ^ ^ 2 \ . ^ 

Deviation ui N = jt^tf — ; — \ X error m P.E. 

d(P.E. sing.) 

( coefficient \ 2 
difference / xRE - sin &- Xerror in P.E. sing, (formula 4). 

To apply this formula to a particular case, we find from 
table III that the P.E. sing, for fifteen fruits was 1.1; the 
error in 1.1 is found by substituting in formula 3 to be 

047 6 9 

1.1X,/ = 0.14. If we wish odds of 22 to 1 for a difference of 

V15-1 

i.oin ratio, we obtain, by substitution in formula 4: Deviation in 

/3.0 V 
N = 4X( — ) Xi.iXo.i4 = six fruits, therefore the corresponding 

value, 22, found in table III, is in error by six fruits, and the. 

probable number extends from 16 to 28. 

The corresponding formula for applying a correction to for- 

( coefficient \ 2 
mula2is: Deviation inN = 2Xt-7r^ ) X P.E. sing. X deviation 

in P.E. sing, (formula 5). 

Data on other lots of oranges 

The discussion thus far has related to the data from only one 
lot of oranges from a single tree. Fruits from four other trees were 
obtained and analyzed in the same manner. The number of fruits 
used was small, but some idea of the accuracy of the probable 
errors can be obtained by applying formula 3. The data are shown 
in table V, and serve to indicate values of P.E. sing, that may be 
expected in dealing with different lots of oranges. 

Data on grapefruit 

Fifty fruits were taken at random from a grapefruit tree in one 
grove, and a corresponding number from another tree located in 
another grove. The fruits were analyzed individually and the mean 
and P.E. sing, determined. To save space, the complete analyses 
are not given, but the results are summarized in table VI. From 
this table it is seen that P.E. sing, of the fruit from the two lots is 
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approximately the same with respect to brix and sugar, but P.E. 
sing, for acid and for ratio is considerably different in the two lots. 

Data on lemons 

That different lots of fruit show different values for P.E. sing, is 
also apparent from the analysis of individual lemons. In table VII 
will be found the results of the analysis of thirty lemons from 
two different lemon trees, each tree representing a different strain 

TABLE V 
Showing different values of P.E. sing, with different lots of oranges 



Tree no. 


No. of 

FRUITS IN 
SAMPLE 


Degrees brix 


Percentage acid 


Sol. sol. ,. 

r-r- ratio 

acid 




Mean 


P.E. sing. 


Mean 


P.E. sing. 


Mean 


P.E. sing. 


2 


12 

13 
12 

9 


I3-70 
15.00 
11.80 
12.45 


O.S 
0.5 
0.4 

0.3 


0.87 
0.86 

°-79 
1.46 


0.07 
0.04 
0.08 

O.IO 


15-9 

17-4 

15.2 

8.6 






0.8 




1-3 
0.7 


5 





TABLE VI 

Comparison of composition of fruit from two grapefruit trees 



Tree no. 


Total 

NO. OF 
FRUITS 


Brix 


Percentage 
Sugar 


Percentage 
acid 


SOLIDS-ACID 
RATIO 


Mean 


P.E. 

sing. 


Mean 


P.E. 
sing. 


Mean 


P.E. 
sing. 


Mean 


P.E. 

sing. 




5° 
50 


13-15 
12.30 


0-35 
0.35 


8.16 
7.89 


O.27 
O.29 


2.29 
1.65 


O.OI 
O.09 


5-8 
7-5 




2 


0.4 





of the Eureka variety. While too much reliance cannot be placed 
on the values obtained by analyzing fifteen fruits, it is seen from 
the table that the two lots of fruit probably have different values 
of P.E. sing, with respect to three of the characters of which 
analytical results were obtained. 

Further precautions regarding use of formulas 

Two further precautions may now be added regarding the use 
of the formulas. When the value of P.E. sing, has been found for 
one tree or lot of fruit, it must not be assumed that another tree 
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or lot will have the same value (compare acidity of two grapefruit 
trees, table VI). When two trees or lots of fruit are found to have 
the same value for P.E. sing with respect to one constituent, it must 
not be assumed that they agree also with respect to other constitu- 
ents (compare trees no. 1 and no. 2, table VI, with respect to 

brix and ratio) . 

TABLE VII 

Variability in composition of individual lemons, Eureka variety 



Eureka strain* 



Shade tree strain* 



Lemon no. 



Sp. Gr. 
of fruit 



Percent- 
age 

rind 



Refrac. 

index 

of juice 



Acidity 
of juice 

cc. 
NaOH 



Lemon no. 



Sp. Gr. 
of fruit 



Percent- 
age 

rind 



Refrac. 

index 

of juice 



Acidity 
of juice 

cc. 
NaOH 



I . 

2 . 

3- 
4- 
5- 

6. 

7- 
8. 

9- 
10. 

11 . 

12 . 
13- 
14. 
IS- 



0.92 
0.96 
0.94 
0.96 

°-9S 
0.94 



96 
95 
94 
96 

95 
95 
96 
96 
0.97 



40 
40 
5° 
49 
49 
46 
48 

35 
46 

5° 
So 
48 
57 
39 
49 



42. 
44 
44 
44 
45 
44 
45 
44 
43 
46 

45 

45 

43-8 

43-9 

45-4 



27.2 
28.5 
28.8 
29.7 
27.9 
3°-7 
30. S 
28.7 
28.1 

25-1 
27.8 
28.0 

29-3 
28.0 
28.7 



1 . 

2. 
3- 
4- 
5- 
6. 

7- 
8. 

9- 

10. 
11 . 

12. 

13- 
14- 

IS- 



0.96 

0.94 

0.96 

0.98 

0.98 

o. 

o. 

o. 

o. 

o. 

o. 

o. 

o. 

o. 

o 



97 



41 
59 
So 
36 

47 
62 
56 
54 
47 
54 
48 
39 
5i 
54 
59 



Mean. . . . 

P.E. mean 
P.E. sing. . 



o.95 



=0.002 
=0.007 



46 

= 1.0 
=4.0 



44.6 

="=0. 2 
±0.6 



Mean. 



=0.2 
= 0.9 



P.E. mean 
P.E. sing. 



0.97 

=0.003 
=0.010 



5° 

= i-3 
= 5° 



45 



24-3 
24.4 
24.9 
26.8 
21.8 
24.0 

24- S 
22.9 
22.0 
20.8 
26.6 
26.3 
22.6 
20.9 
22.0 



23-7 

±=0.4 
^1-3 



* Strains described by Shamel, Scott, and Pomeroy (8). 



Comparison of standard formula with Peter's formula for 
calculating probable error of single observation 

Two general methods for calculating the value of P.E. sing, 
are as follows : 



Standard formula 



Peter's formula 



1 2d 2 Sd 

P.E. sing. = ±0.6745 \—. l P.E. sing. = ± 0.8453 v n ( n _ x ) 

Thus, to use the standard formula, the sum of the squares of 
the deviations must be found, while with Peter's formula only the 
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sum of the deviations (taken without regard to sign) is needed. In- 
asmuch as the latter method is more convenient, it seemed profitable 
to show the difference in the value of P.E. sing, given by the two 
methods. In table VIII are shown the comparative values found. 3 
It is seen that the difference in the value of P.E. sing, by the two 
methods is at least not more than is shown between two groups of 
even the same lot of fruit. Hence no large error would have been 
introduced by the use of the more convenient Peter's formula. 



TABLE VIII 

Comparison of standard formula with Peter's formula for calculating 
probable error of single observation 



NO. OF FRUITS IN 
SAMPLE 



IO 
IS 
25 
3° 

4° 
45 
5i 



P.E. SING. OBSERVATION 



Solids-acid ratio 



Standard 
formula 



I.09 
I. OI 
I.09 
I . IO 
1 .09 
1.20 
I.26 



Peter's 
formula 



I.08 



I. IO 

1 -13 
1.02 
1. 22 
1.29 



NO. OF FRUITS IN 
SAMPLE 



IO 

15 

20 

25 

35 
45 
Si 



P.E. SING. OBSERVATION 



Percentage sugar 



Standard 
formula 



0-39 
0-34 
0.44 
0.42 
0.40 
0.40 
o-39 



Peter's 
formula 



0.40 

0-34 
0.42 

°-43 
0.40 
0.40 
0.38 



Summary 

1. Formulas are given, for use under two different conditions of 
sampling, to determine the number of fruits required in a sample 
in order to give a desired assurance that a certain accuracy has 
been attained. 

2. Approximately 250 fruits of oranges, lemons, and grapefruit 
were analyzed individually, and the probable errors calculated - 
The data so obtained were applied to the formulas, and numerical 
examples worked out to illustrate their use. 

3. It is shown that the values given by the formulas are only 
approximately correct. The sources of error are discussed, and 
formulas given by which the amount of this inaccuracy may be 
estimated under different conditions. 



3 Computations are made much easier by the use of tables given by Mellor (4). 
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4. Analyses of fruits taken from different orange, lemon, and 
grapefruit trees are given, showing the variability of the fruits of 
different trees with respect to brix of juice, percentage of sugar, 
acidity, etc., and the values of the probable errors that such 
variability produced. 

The writer wishes to express appreciation to Mr. E. M. Chace 
and Mr. C. G. Church for cooperation in obtaining the analytical 
data and for criticism of the manuscript. 

United States Department or Agriculture 
Laboratory of Fruit and Vegetable Chemistry 
Los Angeles, Cal. 
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